Wiktionary: A new rival for expert-built lexicons? Exploring the possibilities of collaborative lexicography
نویسنده
چکیده
With the rise of the Web 2.0, collaboratively constructed language resources are rivalling expert-built lexicons. The collaborative construction process of these resources is driven by what is called the “Wisdom of Crowds” phenomenon, which offers very promising research opportunities in the context of electronic lexicography. The vast number and broad diversity of authors yield, for instance, quickly growing and constantly updated resources. While expert-built lexicons have been extensively studied in the past, there is yet a gap in researching collaboratively constructed lexicons. We therefore provide a comprehensive description of Wiktionary – a freely available, collaborative online lexicon. We study the variety of encoded lexical, semantic, and cross-lingual knowledge of three different language editions of Wiktionary and compare the coverage of terms, lexemes, word senses, domains, and registers to multiple expert-built lexicons. We conclude our work by discussing several findings and pointing out Wiktionary’s future directions and impact on lexicography.
منابع مشابه
GLÀFF, a Large Versatile French Lexicon
This paper introduces GLÀFF, a large-scale versatile French lexicon extracted from Wiktionary, the collaborative online dictionary. GLÀFF contains, for each entry, inflectional features and phonemic transcriptions. It distinguishes itself from the other available French lexicons by its size, its potential for constant updating and its copylefted license. We explain how we have built GLÀFF and c...
متن کاملIWNLP: Inverse Wiktionary for Natural Language Processing
Nowadays, there are a lot of natural language processing pipelines that are based on training data created by a few experts. This paper examines how the proliferation of the internet and its collaborative application possibilities can be practically used for NLP. For that purpose, we examine how the German version of Wiktionary can be used for a lemmatization task. We introduce IWNLP, an openso...
متن کاملTo Exhibit is not to Loiter: A Multilingual, Sense-Disambiguated Wiktionary for Measuring Verb Similarity
We construct a new multilingual lexical resource from Wiktionary by disambiguating semantic relations and translations. For this task, we propose and evaluate an automatic disambiguation method that outperforms previous approaches significantly. We additionally introduce a method for inferring new semantic relations based on the disambiguated translations. Our resource fills the gap between exp...
متن کاملA Study on the Semantic Relatedness of Query and Document Terms in Information Retrieval
The use of lexical semantic knowledge in information retrieval has been a field of active study for a long time. Collaborative knowledge bases like Wikipedia and Wiktionary, which have been applied in computational methods only recently, offer new possibilities to enhance information retrieval. In order to find the most beneficial way to employ these resources, we analyze the lexical semantic r...
متن کاملGrassroots Efforts in Contemporary Urban Mapping: An Analysis of Alternative Uses of Collaborative Platforms
Technologies have started to overlap new virtual communication and information layers on top of the urban physical territory, thus bringing along distinct possibilities of social organization. Regarding this phenomenon and intending to achieve improvement in a great variety of fields from Politics to Urban Planning, the terms of Smart or Digital Cities among others have been adopted, still with...
متن کامل